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[57] ABSTRACT 

A learning method for performing a method of inductively 
learning rules from machine data, and a method for analyz- 
ing an inference process to learn rules useful for increasing 
efficiency by a common program are provided. Data desired 
to be classified is converted into a graph and is then input. 
The course of process by a computer is also converted into 
a graph and is then input. Then, patterns appearing in the 
input graph are extracted and patterns having a good evalu- 
ation result are output. The patterns are converted into rules 
for classification and rules for high-speed operation in 
accordance with a kind of the input graph, and are output. 

8 Claims, 7 Drawing Sheets 
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FIG . I 



ALGORITHM CLIP (G[ n , C,L,W) 

INPUTS Gin: COLORED DIGRAPH 
C- EVALUATION METHOD 
L,W: SEARCH PARAMETER 
OUTPUT W- COMBINATION OF 0 OR MORE PATTERNS 
CONTAINED IN Gin 



Bnext— Bnext U PATTERN MODIFICATION 

(Gtmp, Vtmp) 

FOR EACH Vtmpl^ B TO DO 
FOR EACH Vtmp26 B T0 00 
IF Vtmp|*v t mp2 THEN 
Vnew — Vtmpl L)Vtmp2 
ADD Vnew TO Bnext 
B< SELECT W VIEWS FROM Bnext ON THE BASE 
OF C 

Vl<SELECT BEST VIEW FROM Bnext ON THE BASE OF 
C 

RETURN Vt 



PROCEDURE PATTERN MODIFICATION (G,V) 
INPUTS G : COLORED DIGRAPH 

V^ COMBINATION OF 0 OR MORE PATTERNS 
OUTPUT B: COMBINATION OF 0 OR MORE VIEWS 
BEGIN 

B — 0 

FOR TEMPORARY PATTERNS P IN G TO DO 
Vnew — VU {INVERSELY CONTRACTED PATTERN OF P> 
ADD Vnew TO B 

RETURN B 



BEGIN 



Vo — 0 i B — {Vo} ; I I 
REPEAT L DO 
Bnext— 0 

FOR EACH Vtmp€ B TO 
Gtmp < GRAPH OF Gm CON 
CONTAINED IN Vtmp 




END 



END 
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FIG . 3 
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FIG . 7A 
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FIG. 9 
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METHOD FOR LEARNING DATA converted into a graph and is then input. Patterns appearing 

PROCESSING RULES FROM GRAPH in the input graph are then extracted and patterns having 

INFORMATION good evaluation results are output. A pattern is converted 

into a rule for classification or a rule for high-speed opera- 

This application is a continuation of application Ser. No. 5 tion in accordance with a kind of the input graph and is then 

08/164,564 filed on Dec. 10, 1993, now abandoned, which output, so that two kinds of machine learning works which 

is a Continuation-In-Part of application Ser. No. 07/872,140 could not be processed heretofore by the same method can 

filed Apr. 22, 1992, now U.S. Pat. No. 5,353,384. be implemented by the same method. 

BACKGROUND OF THE INVENTION 1Q BRIEF DESCRIPTION OF THE DRAWINGS 

1. Field of the Invention FIG. 1 shows a process procedure of an embodiment of 
The present invention relates to preparation of a knowl- ^ p resent invention- 

edge base indispensable to an expert system, and more pjQ 2 shows the course of graph processes selected as 

particularly to a leannng system for proving to the expert fof ^ the ^ 

system a mechanism for automatically preparing the knowl- is ** 

edge base which has been made by a person heretofore. FIG - 3 * a schemata diagram illustrating a method of 

2. Description of the Related Art converting data desired to be classified into a graph; 
Heretofore, a machine learning method has been studied FIG - 4 * a "hematic diagram illustrating a method of 

by being broadly divided into two methods including a ^ converting a pattern into a rule for classification; 

method of learning a rule from data inductively and a 20 FIG. 5 is a schematic diagram illustrating a method of 

method of analyzing an inference process to leam a rule for converting the course of the process by a computer into a 

increasing efficiency. graph; 

As an example of the former, J. R. Quintan, "Induction of FIG. 6 is a schematic diagram illustrating a method of 

Decision Trees", Machine Learning, pp. 81106, 1986 25 converting a pattern into a rule for highspeed operation; 

describes a method of inductively learning a rule from data. FIGS. 7 A and 7B show collected examples of programs 

As an example of the latter, Tom M. Mitchell, Richard M. used by operation of the computer user in order of operation; 

Keller and Smadar T Kedar-Cabelli, "Explanation-Based FIGS. 8A and 8B show graphed examples of the programs 

Generalization: A Unifying View" Machine Learning Vol. 1, collected in order of operation; and 

No. 1, pp. 47-80, 1986 describes a method of efficiently 30 pjQ 9 shows ^ operation supporting system for imple- 

solving a similar example after learning by storing an ment i n g the process procedure of the present invention, 
application series of knowledge necessary for solution of a 

specific problem. DESCRIPTION OF THE PREFERRED 

Further, as a method utilizing the algorithm similar to the EMBODIMENT 

present invention, "Concept Learning from Inference Pat- 35 ijCi . . ■ j *u a 

tern" by Kenichi Yoshida and Hiroshi Motoda, Journal of A* embodiment of the present invention is now described 

Artificial Intelligence Society of Japan, pp. 119-129, July ^ reference t0 the accompanying drawings. 

1992 has been proposed for the purpose of attaining the FIG. 1 shows a basic algorithm of the present invention, 

object similar to the latter. Data desired to be classified and the course of a process by 

The prior art has a problem that separate programs for 40 a computer are converted into a graph Gin and are then input 

performing the two methods are required when both func- to the computer. The conversion method is described later, 

tions are necessary in a computer. n ™> P atterns appearing in the input graph Gin are extracted 

For example, when there is considered an operation by the algorithm shown in FIG. 1 and a combination (view) 

supporting system for a computer which analyzes operation of patterns having good evaluation results on the basis of an 

instructions 'given halfway by the user and infers the sub- 45 evaluation method C input separately is extracted. Finally, 

sequent operation to perform the operation automatically, the extracted patterns are converted into rules for classffi- 

the pure inductive learning method is excessively influenced ^ or ™ les &r highspeed operation in accordances* 

by "noise due to the operation for processing [a suddenly] a of the input graph and are then output. L and W are 

incoming electronic mail" [excessively] when the operation * earch P™meteis and proper numbers are input therefor, 

history of the computer is analyzed and as a result, it cannot 50 The most important operation in the algorithm is pattern 

be put into practice. Further, the pure deductive learning modification. A temporary pattern in the pattern modifica- 

requires a knowledge base capable of analyzing all opera- u°n is a pattern constituted by two connected nodes in a 

tions or user's intention and cannot be practically realized contracted graph. An inversely contracted pattern of the 

either, temporary pattern is a pattern corresponding to the tempo- 

55 rary pattern in the input graph Gin before contraction. When 

SUMMARY OF THE INVENTION one or bolh of me nodes of the temporary pattern are 

It is an object of the present invention to provide a contracted nodes and correspond to a pattern composed of a 

learning method for attaining two learning functions within plurality of nodes in the input graph Gin, the inversely 

the same program to solve a problem. contracted pattern is a complicated pattern. The pattern 

The object is achieved by a machine learning method go modification is repeated as a parameter i in the algorithm, is 

including the steps of analyzing an input graph, extracting a increased, and therefore a complicated pattern is extracted, 

set of patterns frequently appearing in the graph, evaluating When the temporary pattern is taken out by the pattern 

the extracted pattern based on the resulting graph size modification, only patterns having the information of muru- 

reduction, and outputting a pattern having a good evaluation ally exclusive events, that is, class information attached to 

result. 65 the root node are extracted as the pattern. Thus, as described 

Data desired to be classified is converted into a graph and later, the taken-out pattern can be interpreted as a classifi- 

is then input. The course of a process by a computer is also cation rule. 
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FIG. 2 shows an example for explaining the course of 
processes of the algorithm shown in FIG. 1. In FIG. 2, the 
numbers 1-13 in the "contracted result of graph pattern" 
column are the graph color which corresponds to pattern 
information. The mark "P" and "N" are the graph color 
which corresponds to class information. How to make these 
color and graph is explained later 5 and 1 are input as L and 
W. Respectively, and as the evaluation method C the fol- 
lowing expression is assumed to be designated with respect 
to the contracted graph. In the designation of the evaluation 
method, a numerical expression including some data relative 
to the properties of a graph such as the number of nodes and 
the number of links is designated and the program realizing 
the algorithm evaluates the numerical expression actually to 
select the expression having a large value, although the 
expression having a small value may be selected depending 
on the nature of problems. Description is made by using the 
following designation. 



Evaluation Expression 

o Number of Nodes + Sum Total of values calculated 
by f concerning color expressing pattern 
information in graph 
f (color) 

= Evaluation Expression Number of Nodes + Sum 
Total of ffPattem) (I) f(Pattem) - max (1, Square 
of Number of Classes for the nodes which have the 
Pattern) 



10 



15 



When i is 0, Vtmp is empty and accordingly the con- 
tracted graph is Gin (input graph) itself. At this time, in FIG. 
2, the number of nodes in the contracted graph is 20, and the 
value of the first term of equation (1) is 20. For each pattern 
information in the contracted graph, the number of the 
classes for the nodes which have the pattern information are 
calculated, and the square of the numbers are summarized as 
the value of the second term of equation(2). In "i-0" row of 



FIG. 2, 4 graph nodes have color 1 (pattern 1), and 2 of them 

also have color P (i.e. class P) and remaining 2 have color 4 q of the first term of equation (1) is 12. In 



nodes have color 9 (pattern 9), and 2 of them also have color 
P (i.e. class P) and remaining 2 have color N (i.e. class N). 
Thus f(pattern 9) is 4 (square of 2, P and N). In "i=l" row 
of FIG. 2, 4 graph nodes have color 3, and none of them have 
class information ("P" or "N"), thus f(pattern 2) is 1 
(maximum of 1 and 0). Similarly, f(pattern 4) is 1, f(pattern 
5) is 1, f (pattern 6) is 1, f(pattern 7) is 1, f(pattern 8) is 1. 
Thus the sum of f(pattern) is 10, and the value of equation 
(1) is 26(-16+0). In this case, as the temporary patterns, "9, 
3", "9, 4", "9, 5", "9, 6", "9, 7" and "9, 8" are extracted and 
these patterns become the inversely contracted patterns "1, 
2, 3", "1, 2, 4", "1, 2, 5", "1, 2, 6", "1, 2, T and "1, 2, 8". 
Further, it is assumed that the view including two patterns 
"1, 2, 5" and "1, 2" are selected. 

When i is 2, Vtmp includes the patterns "1,2, 5" and "1, 
2". The patterns are assumed to be named as view nodes 10 
and 9, respectively. At this time, in FIG. 2, the number of 
nodes in the contracted graph is 14, and the value of the first 
term of equation (1) is 14. In "i=2" row of FIG. 1 2, 2 graph 
20 nodes have color 9 (pattern 9), and all of them also have 
color N (i.e. Class N). Thus f(pattern 9) is 19 square of 1, N). 
In "i=2" row of FIG. 2, 2 graph nodes have color 10 (pattern 
10), and all of them also have color P (i.e. class P). Thus 
f(pattern 10) is 1 (square of 1, P). In "i«2" row of FIG. 2, 4 
graph nodes have color 3, and none of them has class 
information ("P" or "N"), thus f(pattern 3) is 1 (maximum 
of 1 and 0). Similarly, f(pattern 4) is 1, f(pattern 6) is 1, 
f(pattern 7) is 1, f(pattern 8) is 1. Thus the sum of f(pattern) 
is 7, and the value of equation (1) is 21(=14+7). In this case, 
as the temporary patterns, "10, 3", "10, 4", "10, 6", "9, 3 " 
, "9, 4", "9, T and "9, 8" are extracted and these patterns 
become the inversely contracted patterns "1, 2, 5, 3", "1, 2, 
5, 4", "1, 2, 5, 6", "1, 2, 3", "1, 2, 4", "1, 2, 7" and "1, 2, 
8". Further, the view including two patterns "1,2,5, 3" and 
"1, 2" is assumed to be selected. 

When i is 3, Vtmp includes the patterns "1, 2, 5, 3" and 
"1, 2". These patterns are assumed to be named as view 
nodes 11 and 9, respectively. At this time, in FIG. 2, the 
number of nodes in the contracted graph is 12, and the value 

i=3" row of FIG. 



25 



30 



35 



N (i.e. class N). Thus f(pattern 1) is 4(square of 2, P and N). 2,2 graph nodes have color 9 (pattern 9), and all of them also 

In "i-0" row of FIG. 2, 4 graph nodes have color 2, and none have color N (i.e. class N), Thus f(pattern 9) is 1 (square of 

of them has class information ("P" or "N"), thus f(pattern 2) 1, N). In "i=3" row of FIG. 2, 2 graph nodes have color 11 

is l(maximum of 1 and 0). Similarly, f(pattern 3) is 1, (pattern 11), and all of them also have color P (i.e. class P). 

f(pattern 4) is 1, f(pattern 5) is fl, f(pattern 6) is 1, f(pattern 45 Thus f(pattern 11) is 1 (square of 1, P). In "i-3" row of FIG. 

7) is 1, f(pattern 8) is 1. Thus the sum of f(pattern) is 11, and 2, 2 graph nodes have color 3, and none of them has class 

the value of equation (1) is 31 (=20+11). Accordingly, since information ("P" or "N"), thus f(pattern 3) is 1 (maximum 

f=l for the pattern information 2 to 8, giving 7 in total and of 1 and 0). Similarly, f(pattern 4) is 1, f(pattern 6) is 1, 

the pattern information 1 has two P and N, which gives f=4 f(pattern 7) is 1, f(pattern 8) is 1. Thus the sum of f(pattern) 

and the evaluated value is 31 in total. In this case, "1, 2", "1, 50 is 7, and the value of equation (1) is 19 (=12+7). In this case, 

4", "1, 5", "1, 6", "1, T and "1, 8" are extracted as the the temporary patterns, "11, 4", "11, 6", "9, 3", "9, 4", "9, 

temporary patterns and these are coincident with the 7" and "9, 8" are extracted and these patterns become the 

inversely contracted pattern. Further, it is assumed that a inversely contracted patterns "1 , 2, 5, 3, 4", "1,2, 5, 3, 6", 

view including only one pattern "1, 2" is selected. This "1, 2,3", "1, 2, 4", "1, 2, 7" and "1, 2, 8" . Further, the view 

selection is based on the value of equation (1). The view (i.e. 55 including two patterns "1,2, 5, 3" and "1,2,3, 4" is assumed 

combination of patterns) which results better value of equa- to be selected. 

tion (1) is selected, and the assumption "a view including When i is 4, Vtmp includes the patterns "1, 2, 5, 3" and 

only one pattern 1,2 is selected" is for the explanation. «i ) 2, 3". These patterns are assumed to be named as view 

At this time, the pattern "2, 3" is also considered as the nodes 11 and 12, respectively. At this time, in FIG, 2, the 

pattern having two nodes, while both of the nodes have no 60 number of nodes in the contracted graph is 10, and the value 

class information of the mutually exclusive events P and N of the first term of equation (1) is 10. In "=4" row of FIG. 

and accordingly extraction of the pattern is not made. 2, 2 graph nodes have color 12 (pattern 12), and all of them 

When i is 1, Vtmp includes only the pattern "1, 2". This also have color N (i.e. class N). Thus f(pattern 12) is 1 

is assumed to be named as a view 9. The contracted graph (square of 1,N). In "i=4" row of FIG. 2, 2 graph nodes have 

is different from Gin. At this time, in FIG. 2, the number of 65 color 11 (pattern 11), and all of them also have color P (i.e. 

nodes in the contracted graph is 16, and the value of the first class P). Thus f(pattern 11) is 1 (square of 1,P). In "i=4" row 

term of equation (1) is 16. In "i-1" row of FIG. 2, 4 graph of FIG. 2, 3 graph nodes have color 4 and none of them has 
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class information ("P" or "N"), thus f(pattern 4) is 1 
(maximum of 1 and 0). Similarly, f(pattern 6) is 1, f(pattern 

7) is 1, f(pattern 8) is 1 . Thus the sum of f (pat tern) is 6, and 
the value of equation (1) is 16(=10+6). In this case, as the 
temporary patterns, "11, 4", "11, 6", "12, 4", "12, 7" and 5 
"12, 8" are extracted and these patterns become the inversely 
contracted patterns "1, 2, 5, 3, 4", "1, 2, 5, 3, 6", "1 , 2, 3, 
4", "1, 2, 3, T and "1, 2, 3, Further, it is assumed that 
the view including two patterns "1, 2, 5, 3" and "1, 2, 3, 4" 

is selected. 

When i is 5, Vtmp includes the patterns "1, 2, 5, 3" and 
"1, 2, 3, 4". These patterns are assumed to be nodes 11 and 
13, respectively. At this time, in FIG. 2, the number of nodes 
in the contracted graph is 8, and the value of the first term 
of equation (1) is 8. In "i-5" row of FIG. 2, 2 graph nodes 15 
have color 13 (pattern 13), and all of them also have color 
N (i.e. class N). Thus f(pattern 13) is 1 (square of 1, N). In 
"i.-5" row of FIG. 2, 2 graph nodes have color 11 (pattern 
11), and all of them also have color P (i.e. class P). Thus 
f(pattern 11) is 1 (square of 1,P). In "i=5" row of FIG. 2, 1 2Q 
graph node has color 4, and it does not have class informa- 
tion ("P" or "N"), thus f (pattern 4) is 1 (maximum of 1) and 
0). Similarly, f (pattern 6) is 1, f (pattern 7) is 1, f (pattern 

8) is 1. Thus the sum of f(pattern) is 6, and the value of 
equation (1) is 14(=8+6). ^ 

FIG. 3 illustrates the method of converting data desired to 
be classified into a graph. From the nucleotide sequence of 
DNA, it is assumed that the classification rule as to whether 
the sequence is a promoter or not is inductively from data. 
In this case, Gin is a set of small graphs each corresponding 30 
to the DNA sequence. The individual small graph is a graph 
of a depth 1 constituted by one root node and end node 
number of which is equal to the length of the DNAsequence. 
Each graph node has the information about pattern and class 
as node color. In FIG. 3, the root node of each graph has a 35 
color that corresponds to the class information "positive" or 
"negative". Each leaf nodes has a color that corresponds to 
the nucleotide, i.e. A,T, C and G, in the DNA sequence. 

The pattern information of the root node has always 1 and 
the class information has identification information of the 40 
positive example and/or the negative example. The pattern 
information of the leaf node has the nucleotide information 
of the DNA sequence and has no class information. 

After the data has been converted into the graph by means 
of the above method, the patterns are extracted by the 45 
algorithm of FIG. 1. FIG. 4 illustrates the method of 
converting an extracted pattern into a rule for classification. 
If the class information of the root node of the extracted 
pattern is made to be a conclusion of the classification rule 
and the pattern information of the leaf nodes is made to be 50 
a condition, this forms [a] classification rules [can be] for 
each pattern. 

FIG. 5 illustrates the method of converting the course of 
the process by the computer into the graph. In this case, the 
graph can be regarded as a kind of proof tree. Each node 55 
corresponds to the (intermediate) conclusion appearing in 
the inference process. The inference rule utilized to obtain 
the result is stored as the pattern information. As the class 
information, the "positive/negative" information is stored 
depending on whether the inference has been successful by 60 
selection of an alternative plan which is provided for selec- 
tion when a plurality of inference rules are applicable. When 
there is no alternative plan, the "positive" may be stored, 
although there is no problem even if the class information is 
not stored. Further, information indicative of the order of 65 
branch in the node is attached to indicate information of the 
unification. 



The information of the unification to be stored is different 
depending on the used inference system. For example, if the 
original calculation process is made by prolog, information 
indicative of the order of term in the body portion is stored 
as information indicative of the order of the branch. Since 
the prolog process is a general proof process, information to 
be stored can be selected with reference to an example of 
prolog even if the inference system other than the prolog is 
utilized. 

In the above method, after the course of process by the 
computer has been converted into a graph, the pattern can be 
extracted by the algorithm of FIG. 1. FIG. 6 illustrates the 
method of converting the pattern into the rule for high-speed 
operation. If the pattern information of the root node having 
"positive" as the class information of the root portion, of the 
extracted patterns is made to be the conclusion and the 
pattern information of the leaf nodes is made to be the 
condition, the rule for high-speed operation with which the 
inference succeeds can be prepared from each pattern. The 
rule with which the inference fails can be extracted from the 
pattern having "negative" as the class information of the root 
node. 

In the embodiment, in order to demonstrate the simple 
realization method on the computer, the algorithm for evalu- 
ating the extracted pattern every time has been described. 
This method has a defect in that the processing speed is slow 
although the realization is easy. This defect can be removed 
to attain the high-speed processing by substituting the 
method described below for evaluation of the combination 
of the pattern stored in the variable Bnext in the algorithm 
of FIG. 1. 

The combination of the pattern stored in the variable B is 
first actually evaluated in accordance with the above evalu- 
ation method. Then, when the combination of the pattern of 
Bnext is evaluated, the substitute evaluation result defined 
by the following expression is used for the combination of 
the pattern registered by the pattern modification of the 
combination V of the pattern stored in the variable B. 

Evaluation Expression 

= Evaluation result of V before modification 

- the number of the temporary patterns extracted 
pattern modification 

+ g 
+ h 

- f (color of root of the temporary pattern) 

The following steps calculates the value of g: 

1. Make a new graph G new by removing the temporary 
pattern from G (M . 

2. Calculate f for the pattern color of the root node of the 
temporal pattern for G ww ,. 

3. Assign the result to the value of g. 

The following steps calculate ;the value of h: 

1. Make a new graph G 9UW by extracting the temporary 
pattern only from G^. In this case, G new is made up of 
the temporary pattern in G^. 

2. Calculate f for the pattern color of the root node of the 
temporal pattern for G^„. 

3. Assign the result to the value of h. Calculate f for the 
pattern color of the root node of the temporal pattern for 

The second term of the evaluation expression is a predic- 
tion value of the reduced number of nodes (first term of the 
original evaluation expression), and the third and subsequent 
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terms are prediction values of the reduced number in the 
second term of the original evaluation expression and may 
be a different expression if it represents the tendency thereof. 

An average of the evaluated results of Vtmpl and Vtmp2 
is used as the substitute evaluation result for the combination S 
of the patterns registered by the merging operation of the 
combination of Vtmpl and Vtmp2 of the pattern stored in 
the variable B. 

Usually, since the combination of the patterns stored in 
the variable B is smaller in number than the combination of 10 
the patterns stored in the variable Bnext, the number of times 
of the evaluation of the pattern combination can be reduced 
by the substitute evaluation method. The evaluation requires 
the contraction process of the graph, because the number of 
times of the contraction process requiring the processing 15 
time can be reduced by reducing the number of times of 
evaluation, the high-speed operation can be attained as the 
whole algorithm. 

The similar algorithm is utilized in the above mentioned 
paper "Concept Learning from Inference Pattern by Kentichi 20 
Yoshida and Hiroshi Motoda, Journal of Artificial Intelli- 
gence Society of Japan, pp. 119-129, July 1992, while this 
paper does not consider information of the class to be 
classified, success or failure of the process and the mutually 
exclusive event. Further, in the learning method described in 25 
the above paper, the data desired to be classified can not be 
converted into a graph to be input and the pattern cannot be 
converted into the rule for classification. 

Further, the present invention can be expanded to prepare 
the program utilizing the learning result. FIGS. 7A and 7B 30 
show learning of operation history of a computer as an 
example for explaining this feature, and an example of 
partial preparation of a program by this. 

FIGS. 7A and 7B show examples of the operation histo- 
ries of a computer for preparation of a document. The user 35 
of the computer uses the LaTeX (main program for analyz- 
ing the file in which a document is stored) as a formatting 
system, the bibtex (program for searching for references 
from output of the LaTeX) as a reference searching system 
and the dvi2ps (program for preparing a printing command 40 
from output of the LaTeX) and the ghostview (program for 
analyzing the printing command to display it on a screen) as 
a display system of the formatted of document a fair copy. 
In the history (a), LaTeX command is in plural times in order 
to process the references from the specification of the LaTeX 45 
and the bibtex. 

It is considered that the operation shown in FIG. 7A is 
divided into three commands "searching operation of refer- 
ences" of the first half and three commands "formatting and 
display operation" of the second half. In the initial stage of 50 
preparation of the document, the whole process shown in 
FIG. 7A has been performed every time, while a reference 
is not changed very much in the proofreading stage of the 
second half and accordingly only steps of the second half 
may be executed (FIG. 7B). 55 

FIGS. 8a and 8B show examples of graph representation 
for analyzing the operation histories in the present invention. 
If such graphs which are a collection, in order of operation, 
of programs used in accordance with operation of the user is 
analyzed by the algorithm of FIG. 1, "searching operation of 60 
references", "document formatting and display operation" 
and the like can be learned as a macro operation. Further, 
whether the operation history of the user is coincident with 
the first half of the learned macro operation or not is 
examined, and when it is coincident, the operation of the 65 
second half can be performed automatically. That is, by 
performing the operation of the first half by the user, the 
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program for performing the operation of the second half can 
be prepared and the user can perform the operation of the 
second half without designation of the next command by the 
user. 

FIG. 9 illustrates an operation supporting system for 
implementing the process procedure of the present inven- 
tion. Particularly, as shown in FIG. 9 a user inputs to the 
operation support system 12 various commands to cause the 
execution of various programs as illustrated in FIG. 7Aand 
7B. These programs, for example, LATEX causes various 
analyzing operations to be performed by the analyzing 
mechanism 14. The analyzing mechanism 14 performs the 
analyzing operations in accordance with the algorithm 
shown in FIG. 1. An indication of the user specified opera- 
tions is supplied to the computer 16, A mechanism is also 
provided in the operation supporting system 12 so as to 
evaluate extracted macro operations provided by the ana- 
lyzing mechanism 14, The mechanism 18 evaluates the 
extracted macro operations by determining whether the 
extracted macro operations satisfy a predetermined condi- 
tion and provides an evaluation result. Thereafter, patterns 
included in the extracted patterns are selected based upon 
the evaluation result. The selected pattern is then outputted. 
The computer 16 receives the output from the mechanism 
18. The computer 16 learns the operation performed by the 
user and automatically performs subsequent operations by 
inferring such subsequent operations from the learned opera- 
tions. 

As apparent form the above embodiment, according to the 
present invention, even if separate programs are required in 
the prior art, two learning functions can be provided by the 
same program. Furthermore, the program utilizing the learn- 
ing result can be prepared automatically. 

I claim: 

1. A method of learning in a computer, comprising the 
steps of: 

converting data desired to be processed into graph infor- 
mation; 

inputting said graph information representative of a 
graph; 

analyzing said graph information and extracting patterns 

appearing in said graph information; 
evaluating the extracted patterns to determine whether the 

extracted patterns satisfy a predetermined condition 

and outputting evaluation result information indicating 

a result of the evaluation; 
selecting patterns included in the extracted patterns based 

on the evaluation result information and outputting the 

selected patterns; and 
converting the selected patterns into a data processing rule 

to be output. 

2. A method of learning in a computer, comprising the 
steps of: 

converting data desired to be classified into graph infor- 
mation; 

inputting said graph information; 

analyzing said graph information and extracting patterns 

appearing in said graph information; 
evaluating the extracted patterns to determine whether the 

extracted patterns satisfy a predetermined condition 

and outputting evaluation result information indicating 

a result of the evaluation; 
selecting patterns included in the extracted patterns based 

on the evaluation result information and outputting the 

selected patterns; and 
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converting the selected patterns into a rule for classifica- 
tion to be output. 

3. A method of learning in a computer, comprising the 
steps of: 

converting the course of a process into graph information; 5 
inputting said graph; 

analyzing said graph information and extracting patterns 

appearing in said graph information; 
evaluating the extracted patterns to determine whether the 10 

extracted patterns satisfy a predetermined condition 

and outputting an evaluation result indicating result of 

the evaluation information; 
selecting patterns included in the extracted patterns based 

on the evaluation result information and outputting the 15 

selected patterns; 
converting the selected patterns into a rule for high-speed 

operation of a process to be output. 

4. A method of learning in a computer, comprising the 
steps of: 20 

converting data desired to be processed into graph infor- 
mation; 

inputting said graph information representative of a 

graph; 25 
analyzing said graph information and extracting patterns 

appearing in said graph information; 
evaluating the extracted patterns to determine whether the 

extracted patterns satisfy a predetermined condition 

and outputting evaluation result information indicating 30 

a result of the evaluation; 
selecting patterns included in the extracted patterns based 

on the evaluation result information and outputting the 

selected patterns; and 
converting the selected patterns into a data processing rule 35 

to be output; 

wherein information of a mutually exclusive event includ- 
ing success and failure of a process and a class to be 
classified is input together with the graph information. 4Q 

5. A method of learning according to claim 4, wherein 
extraction of the patterns is performed starting from a 
portion with the information of a mutually exclusive event 
in said graph information. 

6. A method of learning according to claim 4, wherein the 45 
step of evaluating the extracted patterns comprises the steps 
of: 

examining second graph information coincident to the 
extracted patterns and reducing the evaluation result 
information for the extracted patterns in the coincident 50 
graph information that correspond to a plurality of 
kinds of mutually exclusive events. 
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7. An operation supporting system for supporting the 
learning of operations by a computer, said operation system 
comprising: 

a mechanism for converting data desired to be classified 
into graph information representative of a graph to be 
input; 

a mechanism for inputting said graph information; 

a mechanism for analyzing said graph information and 
extracting a pattern appearing in said graph informa- 
tion; 

a mechanism for converting the extracted pattern into a 
rule for classification to be output; 

a mechanism for outputting an extracted pattern by deter- 
mining whether the extracted pattern satisfies a prede- 
termined condition and outputting evaluation result 
information indicating a result of the evaluation; and 

a mechanism for selecting a pattern included in the 
extracted pattern based on the evaluation result infor- 
mation and outputting the selected pattern; 

wherein operations performed by an user are learned by 
the computer and subsequent operations of the user are 
inferred from the learned operations and performed 
automatically. 

8. A method of learning in a computer, comprising the 
steps of: 

converting data desired to be classified and/or a course of 
a process to be performed by said computer into graph 
information; 

inputting said graph information representative of a 
graph; 

analyzing said graph information and extracting patterns 
appearing in said graph information, said extracted 
patterns include mutually exclusive events; 

evaluating the extracted patterns to determine whether the 
extracted patterns satisfy a predetermined condition 
and outputting evaluation result information indicating 
a result of the evaluation; 

selecting patterns included in the extracted patterns based 
on said evaluation result information and outputting the 
selected patterns; 

converting the selected patterns into a data processing rule 
to be output; and 

outputting said data processing rules including rules for 
classification or rules for high-speed operation which 
are converted from the selected patterns in accordance 
with a kind of input graph. 

* * * * * 
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